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Sorting is a fundamental computational process, which fa- 
cilitates subsequent searching of a database. It can be thought 
of as factorisation of the search process. The location of a 
desired item in a sorted database can be found by classical 
queries that inspect one letter of the label at a time. For an 
unsorted database, no such classical quick search algorithm is 
available. If the database permits quantum queries, however, 
then mere digitisation is sufficient for efficient search. Sorting 
becomes redundant with the quantum superposition of states. 
A quantum algorithm is written down which locates the de- 
sired item in an unsorted database a factor of two faster than 
the best classical algorithm can in a sorted database. This 
algorithm has close resemblance to the assembly process in 
DNA replication. 

PACS: 03.67.Lx, 87.15.By 



Consider a collection of items in a database, charac- 
terised by some property with values on the real line. 
Sorting these items means arranging them in an ordered 
sequence according to their property values. Though 
property values along the real line are sufficient for estab- 
lishing the sequence, the values are taken to be distinct, 
and without loss of generality they can be replaced by 
integer labels. The integer labels can be easily digitised, 
i.e. written as a string of letters belonging to a finite 
alphabet. When the alphabet has a letters, a string of n 
letters can label N — a n items. (If the number of items 
in the database is not a power of a, then the database is 
padded up with extra labels to make N — a n .) In digital 
computers, this finite alphabet has the smallest size, i.e. 
a = 2, and the letters are called bits. 

The purpose of sorting is to facilitate subsequent 
searches of that database for items with known values 
of the property. Though digitisation does not change 
the order of the items in the sequence, it simplifies the 
search process. One does not look for the complete string 
of letters in one go, but sequentially looks for one letter 
at a time. A dictionary is a simple example of a sorted 
database with alphabetical ordering. With digitisation, 
the individual search steps have to distinguish amongst 
only a limited number of possibilities, and the maximum 
simplification of the search steps occurs when the alpha- 
bet has the smallest size, i.e. a = 2. For a database of size 
N, sorting requires 0(N \og 2 N) operations |Q. Once a 
database is sorted, all subsequent searches to locate any 
item in it require only 0(log 2 N) queries. Search for a 



desired item in an unsorted database would take 0(N/2) 
queries, provided that the search process has a memory 
so that an item rejected once is not picked up again for 
inspection. This reduction in search effort is what makes 
the laborious process of sorting worthwhile, to be carried 
out once and for all. 

The speed-up achieved by sorting can be understood 
as factorisation of the search process. An alphabet with 
a finite number of letters can be looked upon as a group 
of finite order, and any function defined on such a group 
is expressible as a polynomial. A function defined with 
multiple letters becomes a multivariable polynomial. A 
single variable polynomial can always be fully factored 
over the domain of complex numbers, but a multivari- 
able polynomial may not be. When a polynomial can be 
factored, it is much more efficient to evaluate it by mul- 
tiplying all the factors together than by calculating it 
term by term. The search function is such that it can be 
fully factored, resulting in a speed-up of 0(N/ log 2 N). 
As a simple case consider a database of N = 2™ items. 
Suppose the desired item has the label 



X = X1X2X3X4 



= 1011... 



(1) 



Any function of x reduces to a polynomial in xi because 
Xi € {0, 1} implies xf — Xi. The search process is equiv- 
alent to finding x such that 

f(x) = Y\_ fi(Xi) = xii 1 ~ X2)x 3 X4 ... (1 - x n ) (2) 

i 

equals 1. f(x) evaluates to zero for all the other items in 
the database. In the sorted and digitised database, one 
searches for an item by inspecting only one Xi at a time, 
i.e. evaluating f[x) by sequentially combining its factors 

In the language of algorithms, the function f(x) is re- 
ferred to as an oracle. The capability of evaluating fix) 
for any x is assigned to a black box, and the efficiency of 
algorithms is judged by the number of times the oracle 
is queried. To take advantage of a sorted database, it 
is crucial to have the factorised oracles, f%{xi), available. 
If only the oracle f[x) is available, then the search pro- 
cess requires 0(N/2) queries even with a sorted database. 
The collection of factorised oracles fi(xi) is more power- 
ful than the global oracle f(x) — one can construct f(x) 
by combining all the fi(xi) together, but the individual 
fi(xi) cannot be determined given only f(x). Generally, 
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how an oracle is physically implemented is not of concern 
in the complexity analysis of an algorithm. But it is clear 
that fiixi) are easier to physically implement than f(x), 
because X4 span a much smaller range of values than x. 

With a single query, the factorised oracles are able to 
characterise a group of items (e.g. all the items with 1 as 
the first bit of the label), unlike the global oracle that can 
characterise only a single item. With a sorted database, 
one takes advantage of this property of factorised ora- 
cles. The items that are addressed together by factorised 
oracles are arranged together in a sorted database, and 
not left scattered all over the place. The factorised ora- 
cles then easily pick up only the relevant group of items, 
and implement the search process one letter at a time, 
whereby the size of the group reduces in a geometric pro- 
gression. If the items are not grouped together and left 
scattered all over, the factorised oracles do not offer any 
advantage compared to the global oracle. 

Classically, the only way to group items together is to 
arrange them close to each other, as in a sorted database. 
Quantum physics provides another mechanism for group- 
ing items together quantum superposition of states. Su- 
perposed states are grouped together in the sense that 
they all correspond to the same physical location (in 
space as well as in time), and they all can be simulta- 
neously acted upon by a single physical transformation. 
In a quantum search process, the inspection of items need 
not be carried out picking one item at a time from the 
database; instead the search oracles can be applied di- 
rectly to groups of items superposed together. In what 
follows, I demonstrate that when factorised oracles are 
available, quantum superposition can be used to make 
sorting redundant. 

The quantum parallelism strategy, based on superpo- 
sition of states, is quite general. It makes the algorithms 
more efficient, and it has been exploited to convert clas- 
sically superpolynomial problems into quantum polyno- 
mial ones. Because of linearity of quantum mechan- 
ics, when the initial state is a quantum superposition of 
states, the final state after carrying out the algorithmic 
operations will be a quantum superposition of the cor- 
responding outcomes. Of course, at the end, quantum 
measurement does not permit individual determination 
of each outcome. But if only a single property of the 
possible outcomes is desired, then it may be possible to 
pick it up by a cleverly designed measurement (perhaps 
involving interference of the possible outcomes) . The net 
result is a substantial speed-up of the algorithm. 

Quantum algorithms need to obey an important prac- 
tical constraint, i.e. all their communications with the 
outside world must have classical representations. These 
communications include input, output and oracles. This 
constraint arises from our own limitations as classical ob- 
servers. Our problems are defined in a classical language, 
and we do not know how to make sense out of them other- 
wise. So we always have to find ways to extract classical 



meanings from quantum superpositions. 

To take maximal advantage of quantum parallelism, 
many quantum algorithms start with a uniform quantum 
superposition of all possible classical input states. Such a 
state can be easily created from a single classical state by 
a fully factorised operation. For instance, each qubit in 
an n— qubit string can be put into an equal superposition 
of its two eigenstates, by starting with a zero-string and 
applying the Walsh-Hadamard transformation: 
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Digitisation of the state labels is implicit but essential 
in this process. It reduces the spatial resources needed 
to solve the problem, i.e. 2™ states are encoded using 
only n qubits. An obvious feature of superposition is 
that it is commutative — the order in which the states are 
superposed is irrelevant. This is the property that makes 
sorting redundant whenever the states are combined in a 
superposition. 

Grover's quantum database search algorithm || starts 
with a maximally superposed state, uses a binary search 
function and reaches a single state at the end. Thus the 
input, the output and the oracle have classical represen- 
tations as desired. Grover's algorithm is directly applied 
to an unsorted database. The unitary quantum opera- 
tors used are the search function F (which evaluates to 
— 1 when the labels match and to +1 when they do not) 
and an operation R that reflects all the amplitudes about 
their average. Alternate application of these two opera- 
tors takes the maximally superposed state to the desired 
state. 
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Compared to classical unsorted database search, quan- 
tum parallelism provides a quadratic speed-up and the 
desired item is located with 0(\/N) queries. This rate, 
however, is much slower than that for the classical sorted 
database search, and we can look for ways to boost it. 

Though sorting has become redundant, Grover's algo- 
rithm can still be factorised. Factorisation of the search 
function requires digitisation of the database, so let us 
start with a digitised database. (There is no hindrance 
to digitising a database, even if it is unsorted. All that 
is required is that the items be distinct from each other.) 
Digitisation makes it possible to look at one letter of the 
label at a time. The search process will inspect each let- 
ter of the label in turn, and decide whether it matches the 
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corresponding letter of the desired string or not. Since 
Grover's algorithm requires only one query to pick one 
item out of four with certainty it is efficient to digitise 
the quantum database with a = 4. At the i th step of 
the algorithm, let Fi be the single-letter search function 
which looks at the i th letter of the label. Fi will evaluate 
to —1 when the letters match and to +1 when they do 
not. 



Fi = h 



{±1}< 



(6) 



The reflection operation Ri that goes along with Fi can 
also be written in a factorised form. 



Ri — I\ 




In ■ (7) 



When Ri follows Fi, it carries out the same task as 
R, i.e. reflects all the amplitudes about their average. 
Since 4 is a power of 2, the non-trivial transformation 
matrix in Ri can be decomposed in qubit notation as 
(H <g> H)Fq(H ® H), where Fq and H follow a common 
qubit basis convention and Fq flips the sign of the state 
labeled 0. It is not mandatory however, that Ri be im- 
plemented in this manner. 

In addition, we require the projection/measurement 
operators Pi, for every letter of the label. Once the i th 
letter gets fixed by Pi, it cannot change during the sub- 
sequent steps of the algorithm, i.e. Pi eliminates from 
the quantum Hilbert space all the states whose i th letter 
does not match with the desired label. Then the fac- 
torised quantum search algorithm is: 



\x) = 



n N 
1=1 .7 = 1 V 



(8) 



The starting point is the uniform superposition of all the 
N states. Application of R\F\ doubles the amplitudes 
of all those strings whose first letter matches with the 
first letter of the desired item label, and reduces all the 
other amplitudes to zero. The projection/measurement 
operator Pi does not change any of the amplitudes, but 
removes from the Hilbert space all the states with zero 
amplitude. The intermediate result is then a uniform su- 
perposition over the reduced Hilbert space of dimension 
N/4, containing all the states whose first letter matches 
with the desired item label. One by one, the algorithm 
goes through all the letters of the string and ultimately 
finds the item with the desired label. 

In the classical search of a sorted database, the se- 
quence of i in which the oracles have to be queried is de- 
termined by the convention used for grouping the items 
together during the sorting process. For example, with 
the conventional labeling as in a dictionary, one has to 
first query the first letter using fi(x\), then query the 



second letter using f 2(^X2), and so on. For the factorised 
quantum algorithm, Eq.(8) above, there is no such re- 
quirement; the database is unsorted and the sequence of 
i during the execution of the algorithm does not matter. 

The use of Pi is essential here, so that the eliminated 
states do not take part in the subsequent steps of the al- 
gorithm, i.e. Ri acts only on the Hilbert space of dimen- 
sion 4 1 ~ t N. In a general quantum setting, the reduction 
of dimension of Hilbert space using Pj is a complicated 
operation. But the structure of the factorised search al- 
gorithm is such that all the states eliminated by Pi have 
zero amplitudes. For this limited purpose, Pi can be de- 
noted using identity operators, 



Pi = h 



(-fi)fix ® • • • ® In 



(9) 



Introduction of Pi also provides an error correcting mech- 
anism. If the measured letter of the label does not match 
with the desired letter, due to imperfect implementation 
of Fi and Ri or due to effects of decoherence, then the 
algorithm can be restarted at the i th step, with a uni- 
form superposition of 4 1 ~ l A states. (Such a state can be 
prepared by setting (i — 1) letters to their correctly mea- 
sured values, the remaining (n — i + 1) letters to zero, and 
then applying Walsh-Hadamard transformation to these 
(n — i + 1) letters.) 

The transformations Fi, Ri and P, act on a single let- 
ter of the label only. The equations above show that the 
non-trivial operations are performed only on the i th let- 
ter, while the rest of the letters remain unchanged (iden- 
tity operations). Clearly the resources required to im- 
plement such operations depend only on the size of the 
alphabet a, and are independent of the length of the la- 
bel n. The physical resources needed to implement the 
factorised quantum search algorithm — number of letters, 
number of queries and number of algorithmic steps — are 
therefore all proportional to n. 

We have thus arrived at the first important result of 
this article: The factorised quantum search algorithm 
locates the desired item in an unsorted database using 
0(log 4 N) queries, which is a factor of two improve- 
ment over the best search algorithm for a classical sorted 
database. (If the algorithm has to be restarted to correct 
errors, as mentioned above, then the improvement factor 
will be smaller.) 

The frequent intervention of projection/measurement 
operators in this factorised algorithm makes it look like a 
quantum process that is stabilised by intermediate clas- 
sical states. Indeed, we can think of a process closely 
related to search — assembling a desired string by putting 
together a sequence of letters. The letters have to be 
available in a jumbled up database and an oracle has to 
exist dictating the order in which the letters are to be put 
together. The algorithm will then carry out the assembly 
line operation by adding one letter at a time. At each 
step, one out of a items is selected, and the full string 
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is constructed by putting together n items. The quan- 
tum assembly algorithm requires a superposition of only 
a states at each step, and not N states as in Grover's 
algorithm; this reduction in the number of superposed 
states should make it easier to implement in reality. The 
physical resources necessary to implement the quantum 
assembly algorithm are again proportional to n, and in 
terms of the number of queries made, the quantum as- 
sembly algorithm provides a factor of two speed-up com- 
pared to the best classical assembly algorithm. 

It is too tempting to overlook a fundamental biologi- 
cal process that works in this manner, i.e. replication of 
DNA. (As a matter of fact, biochemistry is full of assem- 
bly processes which synthesise desired objects out of their 
components by pattern recognition oracles.) The DNA 
alphabet has four letters, i.e. the bases A,T,C,G. In DNA 
itself these bases are linked together as a string, but in- 
dividual bases also randomly float around in the cellular 
environment. During replication the two strands of the 
DNA double helix separate. Each intact strand dupli- 
cates itself in an assembly line operation, picking up one 
base at a time from the cellular environment. The intact 
strand is the factorised oracle, with the complementar- 
ity of the bases acting as the search function. The fact 
that this process takes place at the molecular scale and 
uses an alphabet of four letters raises a highly provoca- 
tive thought. Could it be that the evolution of life sensed 
the advantage of a quantum algorithm, and opted to or- 
ganise the genetic information in DNA using four bases? 
Note that classically just two bases (one complementary 
pair) are sufficient to carry the genetic information. This 
question has been discussed elsewhere || . 

Finally, it is worth noting that even though factorisa- 
tion is a classical computational procedure, when com- 
bined with quantum parallelism, it has proved to be use- 
ful in constructing efficient quantum algorithms. Under 
the best circumstances, factorisation can provide a speed- 
up of 0(N/ log 2 N), and so can quantum parallelism. 
Shor's efficient quantum solution to the period finding 
problem Q combines factorisation (FFT is the factorised 
version of Fourier transform) and quantum parallelism. 
Each provides a gain of 0(N / log 2 N), and the final al- 
gorithm requires 0((log 2 TV) 2 ) steps. In the search algo- 
rithm, gains of factorisation (i.e. sorting) and quantum 
parallelism overlap. So one cannot obtain a speed-up 
of 0(N/ \og 2 N) twice, and the quantum factorised al- 
gorithm provides only a factor of two speed-up over the 
classical factorised algorithm. Yet another problem, that 
of finding parity of a set of bits, cannot gain anything 
from factorisation and very little from quantum paral- 
lelism; the best quantum algorithm does not improve the 
classical one by much (both are O(N)) ||. 
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